Automatic Language Identification of Telephone Speech
نویسنده
چکیده
II Lincoln Laboratory has investigated the development of a system that can automatically identify the language of a speech utterance. To perform the task of automatic language identification, we have experimented with four approaches: Gaussian mixture model classification; single-language phone recognition followed by language modeling (PRLM); parallel PRLM, which uses multiple single-language phone recognizers, each trained in a different language; and language-dependent parallel phone recognition. These four approaches, which span a wide range of training requirements and levels of recognition complexity, were evaluated with the Oregon Graduate Institute Multi-Language Telephone Speech Corpus. Our results show that the three systems with phone recognizers achieved higher performance than the simpler Gaussian mixture classifier. The top-performing system was parallel PRLM, which performed two-language, closed-set, forced-choice classification with a 2% error rate for 45-sec utterances and a 5% error rate for lO-sec utterances. For eleven-language classification, parallel PRLM exhibited an 11% error rate for 45-sec utterances and a 21% error rate for 10-sec utterances.
منابع مشابه
Automatic language identification using large vocabulary continuous speech recognition
We have developed a highly accurate automatic language identification system based on large vocabulary continuous speech recognition (LVCSR). Each test utterance is recognized in a number of languages, and the language ID decision is based on the probability of the output word sequence reported by each recognizer. Recognizers were implemented for this test in English, Japanese, and Spanish, usi...
متن کاملPhonetic Landmark Detection for Automatic Language Identification
This paper presents a method of augmenting shifted-delta cepstral coefficients (SDCCs) with the classification outputs of an array of support vector machines (SVMs) trained to detect a set of manner and place features on telephone speech. The SVM array allows for broad phoneme classification, and when this information is concatenated with SDCCs to form a hybrid feature vector for each acoustic ...
متن کاملPerceptual benchmarks for automatic language identification
There has been renewed interest in the eld of automatic language identiication over the past two years. The advent of a public-domain ten-language corpus of telephone speech has made the evaluation of diierent approaches to automatic language identiication feasible. In an eeort to provide benchmarks for evaluating machine performance, we conducted perceptual experiments on 1-, 2-, 4-and 6-secon...
متن کاملLanguage identification using acoustic log-likelihoods of syllable-like units
Automatic spoken language identification (LID) is the task of identifying the language from a short utterance of the speech signal uttered by an unknown speaker. The most successful approach to LID uses phone recognizers of several languages in parallel [Zissman, M.A., 1996. Comparison of four approaches to automatic language identification of telephone speech. IEEE Trans. Speech Audio Process....
متن کاملComparison of four approaches to automatic language identification of telephone speech
AbstructWe have compared the performance of four approaches for automatic language identification of speech utterances: Gaussian mixture model (GMM) classification; single-language phone recognition followed by languagedependent, interpolated n-gram language modeling (PRLM); parallel PRLM, which uses multiple single-language phone recognizers, each trained in a different language; and languaged...
متن کامل